ABS Teacher Resource SPREADSHEETS 


TAKING REAL DATA 
FURTHER MATHEMATICALLY 


Andrew Stewart 


Introduction 

A good definition of statistics is ‘numerical detective work’. Just like any other 
form of detective work, to be successful at statistics you need to practice. This 
exercise provides practice in a range of techniques involved in statistical work 
and is written around using a spreadsheet (Excel). 


Instructions for completing an activity in the spreadsheet are given in a box 
like this one. 


CensusAtSchool data 

In 2008, over 40 000 students around Australia completed an online 
questionnaire, containing about 40 questions, on the ABS website. Sample sets 
of responses can be downloaded for analysis. 


The responses of 100 students selected at random from the full data set has 
been used to set up this activity. 


In this activity, we will look at analyzing two sets of categorical data - and how 
students in years 11 - 12 can complete an analysis. 


The Spreadsheet 


Double click on the icon of the spreadsheet to open it. | 


The key features of the CatCat page are a pair of lists on the left side, two two- 
way frequency tables and two segmented bar charts. On this sheet we are 
looking at two categorical variables. 
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The two variables on display are Sex (gender of responding student) and 
UseComp (do you use a computer at home). 

The data is divided into the four possible combinations - Male/Use computer 
Yes, Male/Use computer No, Female/Use computer Yes and Female /Use 


computer No. 

UseComp 
| ves | 96 | a5 | 
| wo | a | as | 


We can now see that, although there are the same number of males and 
females who use a computer at home (45), because of the slightly smaller 
number of females in the sample compared to males, this number represents 
96% of females sampled compared to 85% of males. Percentages are the best 
way to display this data - and percentages by columns (usually representing a 
key variable such as gender) are the accepted way to calculate the 
percentages. This makes the construction of the segmented bar chart quite 
easy. 


Sex 


UseComp 


Yes 


pas | as | 00 | 
Pakee 


Total 


Click on the UP arrow on the spinner to select Set 2 (USeGame)(do you use a 
computer game at home). The charts and tables will change to reflect the new 
data, and new numbers and percentages are calculated. 


Complete the following two-way frequency tables, putting number values in the 


left table and percentages in the right table : 
vs | | | | | ves | 
a ee 


Comment briefly on these results, especially with respect game use by each 
gender. 
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Click on the UP arrow of the spinner to select Set 3 (Hand)(what is your 
dominant hand). The charts and tables will change to reflect the new data, and 
new numbers and percentages are calculated. 


Complete the following two-way frequency tables, putting number values in the 


left table and percentages in the right table : 


Right handed 


Right handed (nae a 
Ambidextrous| | 


[i 
ftefthanded | | | 

ambidextrous) || 
ee 

Comment briefly on these results, especially with respect to the dominant hand 


for each gender. 


How do these figures compare with the percentages of right handed, left 
handed and ambidextrous people in the population as a whole ? 
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Click on the UP arrow of the spinner to select Set 4 (ColrEye)(eye colour). The 
charts and tables will change to reflect the new data, and new numbers and 
percentages are calculated. 


Complete the following two-way frequency tables, putting the correct number 
values in the left table and correct percentages in the right table : 


Colrye 


Sex 


Colrtye 


Comment briefly on these results, especially with respect to the predominant 
eye colour for each gender. 


We can see that Brown is the predominant colour for males and Blue the 
predominant colour for females - both with around 40% of their respective 
gender. 


How do these percentages compare with those for the population as a whole 
for people with eyes of various colours ? 


However, we notice that because there are a larger number of categories, the 
number of responses per category (on average) is dropping. With larger 
numbers of categories, the number of responses available per category for a 
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100 student sample will be diminished. We will see how this factor affects our 
analyses as we look at questions giving rise to larger numbers of categories. 
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Click on the UP arrow of the spinner to select Set 5 (TrvToScl)(how you 
normally travel to school). The charts and tables will change to reflect the new 
data, and new numbers and percentages are calculated. 


Complete the following two-way frequency tables, putting the correct number 
values in the left table and correct percentages in the right table : 


Sex 


TrvToSl 
; 


So tte Ie sis: 
| ce | a | 5s | 6 
| otrer | 1 | o | a 


We can see that Car, Walk and Bus are the three most popular forms of 
transport for this sample, but the order differs between males (Car, Walk, Bus) 
and females (Car, Bus, Walk). Cycle, once a popular way for students to travel 
is now well down, with more males (9%) cycling than females (2%). Two 
categories had zero responses, and with this number of categories for this 
sample size, this occurrence was always likely. 
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Click on the UP arrow of the spinner to select Set 6 (WhreLive)(which state or 
territory do you live in). The charts and tables will change to reflect the new 
data, and new numbers and percentages are calculated. 


Complete the following two-way frequency tables, putting the correct number 
values in the left table and correct percentages in the right table : 


Male 


Sex 


al 1 


oe ee ee) 


| Male 
nsw | | 
ae es 
/ oid / ow | | | 
| sa | 6 | 
as ier || Ses 
| vic | | 
powa | | 


There is certainly a distinct difference between the genders for state (or 
territory) of residence. 


For females, the order (of popularity) is Vic and NSW (both 26%), Qid, WA, SA 
and Tas (both 6%) and NT with none. 


For males, the order is NSW, SA, WA, Vic, Qld, NT and Tas (both 2%). 


How do these percentages compare with those for the population as a whole 
for which state people live in ? 


A number of categories had very low or zero responses, and with this number 
of categories for this sample size, this occurrence was always likely. 
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Click on the UP arrow of the spinner to select Set 7 (YrLevel)(what year level 
are you at school). The charts and tables will change to reflect the new data, 
and new numbers and percentages are calculated. 


Complete the following two-way frequency tables, putting the correct number 
values in the left table and correct percentages in the right table : 


Male 
Year 4 or 

| years | 9 | 

| years | 

| year? | 

| years | 

| yearo | 
pod 
p62) 
eel 


Sex 


ae 
Year 4 or 
below 2 
| years | 
| year? | 
years | | 


| vearui | 3 
| veari2 | 1 
1 | | other | 


There is certainly a distinct difference between the genders for state (or 
territory) of residence. 


pa | | years | 


4 
3 
1 


2 
53 100 


For females, the order of popularity of school year is 6, 7 and 9 (both 17%), 8, 
10, 5, 6, 4, 12 and other (none). 


For males, the order is 8, 6, 4, 10, 7, 5, 9, other, 11 and 12 (both O). 


A number of categories had very low or zero responses, and with this number 
of categories for this sample size, this occurrence was always very likely. 
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Click repeatedly on the DOWN arrow of the spinner to select Set 8 (BrthMnth). 
The charts and tables will change to reflect the new data, and new numbers 
and percentages are calculated. 


Complete the following two-way frequency tables, putting number values in the 
left table and percentages in the right table : 


Sex 


Male 


ex 


Female Male 


There is certainly a distinct difference between the genders for month of birth. 


For females, the order of frequency of birth month is July, Feb and June, May 
and Sept and Oct and Dec, Aug and Nov, Jan and Mar and Apr.. 


For males, the frequency order is Apr, Jan, June and Aug and Oct, Mar and July 
and Sept and Nov, Feb and Dec, May. 


A number of categories had very low or similar numbers of responses, and with 
this number of categories for this sample size, this occurrence was always very 
likely. 


What are the best numbers for categorical - categorical analysis ? 
Two columns and no more than four rows. 
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